Role: Big Data Engineer
Relationships
Primary Performs
I10A.1 Deploy cluster and nodes
I10A.2 Implement storage system
I10A.3 Implement processing engine
I10A.4 Implement communication platform
I10A.5 Implement resource management
I11.1 Identify data consumers
I11.2 Implement interfaces with data consumers
I4.2 Implement interfaces with data sources
I5.1 Define the data lake
I5.2 Implement the Collector component
I7.2 Implement the algorithms
O15.1 Short term containment
O15.2 System check
O15.3 Long term containment
O15.4 Recovery
Additionally Performs
I10B.1 Decide a commercial virtual solution
I10B.2 Configure the commercial virtual solution
I4.1 Identify data sources
I5.3 Implement security solutions for the Collector
I6.1 Identify the important information from data
I6.2 Define scripts to prepare the data
I6.3 Implement security solutions for the preparator
I7.1 Design the algorithms for the data analysis
I7.3 Implement security solutions for the Analyzer
I8.1 Decide the best way to visualize the information
I8.2 Implement visualization techniques
I8.3 Implement security solutions for the Viewer
I9.1 Define access control rules
I9.2 Implement access control rules
O15.5 Visualization of the status of the Big Data ecosystem
Modifies
Big Data algorithms
Big Data Assets
Communication platform
Data consumers
Data Lake
DC interfaces
DP interfaces
List of actions taken
List of primary actions taken
Processing engine implementation
Registry of expected information
Registry of hardware resources
Registry of software
Resource management
Status of the Big Data ecosystem
Storage system
Main Description
This role aims to create and manage a company's Big Data ecosystem. To do this, the person who has this role must have knowledge about Big Data infrastructure and tools.